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WITH  FORKS  AND  HOPE 


Explanation  and  Apologia 


This  preface  is  written  with  a  profound  and  humble  apology 
to  those  five  or  ten  readers  of  this  paper  who  already  understand 
both  the  source  and  aptness  of  the  major  title.  A  Random  Serial 
Search  of  40  Documentalists  in  Philadelphia  found  an  incidence  of 
^/40  (the  numerator  representing  the,  obviously  better,  half  of  a 
Documentalist);  a  Simultaneous  Parallel  Search  of  an  audience  of 
Electronic  Information  Handlers  in  Pittsburgh,  employing  the 
accepted  "Is  there  a  Carrollite  in  the  House?"  technique  found 
Z/400  (and  one  of  those  cheated,  as  I'd  explained  it  to  him  the 
night  before)  who  recognized  the  source.  My  estimate  of  the 
logical  intersection  of  the  two  classes  is  probably  wildly  optimistic. 
Hence  this  explanation. 

The  phrase  occurs,  as  all  students  of  the  writings  of  Charles 
Lutwidge  Dodgson  (Cantabriggian  mathematician  (1832-  1898))  know, 
passim  in  "The  Hunting  of  the  Snark.  "  A  proper  KPIC  (Key  Phrase 
in  Context)  system  would  show  the  following: 

"You  may  seek  it  with  thimbles  and  seek  it  with  care. 
You  may  hunt  it  with  forks  and  hope 
You  may  threaten  its  life  with  a  railway  share 
You  may  charm  it  with  smiles  and  soap." 

If  one  takes  advantage  of  the  ambiguity  of  "it,"  and  substitutes 
the  "Long  Range  Goals  of  Basic  Research"  for  "Snark"  (carefullyr 
and  deliberately  ignoring  the  problem  of  the  Boojum),  the  need  for 
"Hope"  becomes  obvious. 

"Forks,"  in  this  context,  can  net  be  clarified  without  resort 
to  the  ikons.  In  the  illustration  accompanying  the  1914  edition,  it 
becomes  clear  that  at  least  three  separate  sorts  of  forks  are 
implied.  One  is  a  trident,  standard  Retarius  Mk  1(a)  mode  for 
pinning  the  prey.  Another  is  z  two-pronged  agricultural  implement, 
suitable  for  short-range  prey  transport  and  termination.  The  third, 
representing  the  using  commands,  is  a  smaller,  also  two-tined, 
carving  fork.  Other  necessary  implements,  illustrated  in  the  ikon 
although  not  in  the  text,  are  a  microscope  and  telescope. 

In  summary  then,  if  one  is  pursuing  basic  research  one 
should  do  so  with  both  hope  and  forks. 


WITH  FORKS  AND  HOPE 


INFORMATION  TECHNOLOGY  AND  THE  INFORMATION  SCIENCES 


It  seems  appropriate  in  a  university  ambience  to  begin  with  a 
historical  anecdote  --  one  of  the  very  earliest  instances  1  have  been 
able  to  find  of  the  relations  between  academic  research,  military 
applications,  and  the  government  --  the  story  of  Galileo  and  the 
telescope.  I  am  indebted  to  the  Oxford  "History  of  Technology"  and 
to  Arthur  Koestler  in  The  Sleepwalkers  for  this  information. 

Galileo  did  not  invent  the  telescope,  but  he  probably  made  more 
money  from  it  than  the  man  who  did.  According  to  a  reliable  record 
of  1634,  Johannes  Janssen  or  Jansen,  son  of  the  Dutch  spectacle 
maker  who  probably  did,  declared  that  his  father  "made  the  first 
telescope  amongst  us  in  1604,  after  the  model  of  an  Italian  one,  on 
which  was  written  anno  1590."  Giambattista  della  Porta  of  Naples 
(1536-1605)  describes  in  the  second  edition  of  his  Magiae  Naturalis 
(  1589)  various  ways  of  improving  vision  at  a  distance ,  including  tbe 
use  of  a  convex  and  concave  lens. 

Galileo  may  or  may  not  have  seen  one  of  the  Dutch  telescopes. 
He  claimed  (in  The  Messenger  from  the  Stars)  that  he  had  merely 
read  reports  (from  !DDC  -  the  Dutch  Documentation  Center  ? )  of  the 
invention,  and  that  these  reports  ha^”  stimulated  him  to  construct  an 
instrument  on  the  same  principle,  which  he  had  only  succeeded  in 
doing  through  extensive  basic  research  in  "the  principle  of  refraction. 
This  may  or  may  not  have  been  a  snow  job  --it  certainly  didn't  take 
the  mind  of  Galileo  to  put  a  concave  and  a  convex  spectacle  lens 
together  once  you  knew  that  it  could  be  done. 

Be  that  as  it  may.  Galileo  proceeded  to  make  a  presentation 
and  demonstration  to  the  Venetian  Senate  on  the  tower  of  Saint  Marco 
on  8  Au  ^st  1609,  Three  days  later  he  gave  the  instrument  to  the 
Senate,  together  with  a  Technical  Manual  cum  brochure  explaining 
that  this  instrument,  which  magnified  nine  times,  would  prove  of 
utmost  importance  in  war  since  it  made  it  possible  to  "see  sails  and 
shipping  that  were  so  far  off  that  it  was  two  hours  before  they  were 
seen  with  the  naked  eye,  steering  full  sail  into  the  harbour  ";  thus 
being  invaluable  against  invasion  by  sea. 

Koestler  adds,  in  a  sentence  I  tend  to  use  in  my  more  paranoid 
Pentagon  briefings; 

"It  was  not  the  first  nor  the  last  time  that  pure  research 
that  starved  cur,  snapped  up  a  bone  from  the  warlords'  rich 
banquet .  " 


The  story  does  not  end  there.  Galileo  gave  the  telescope  to 
the  Senate;  the  grateful  Senate  in  return  doubled  his  salary  to  a 
thousand  scudi  a  year,  and  gave  him  tenure  in  his  professorship  at 
the  University  of  Padua,  which  belonged  to  the  Republic  of  Venice. 

I  am  not  entirely  sure  what  the  moral  or  morals  of  this  story 
is/are.  If  the  Senate  had  issued  RFP's  to  meet  their  Military 
Requirement  for  an  improved  Command  and  Control  System,  their 
proposal  evaluation  might  have  reflected  the  needs  of  the  service 
which  opened  the  proposals.  I  can  imagine  that  aerial  types  would 
have  put  in  for  a  fire  tower  on  top  of  the  Tower  of  San  Marco,  that 
aquatic  types  might  have  preferred  a  fleet  of  picket  boats;  and  that 
those  with  more  terrestrial  proclivities  would  have  asked  for  a 
doubled  appropriation  for  coast  artillery,  on  the  theory  that  more 
and  bigger  guns  could  take  care  of  any  problem. 

Like  all  good  stories,  this  has  a  happy  ending.  The  military 
got  a  solution  to  their  problem  that  would  never  have  turned  up 
through  normal  development  channels.  And  Galileo,  rewarded  for 
Keeping  Up  With  The  Technical  Literature  and  seeing  an  Immediate 
Practical  Application,  went  on  to  build  better  telescopes  and  actually 
do  good  basic  research  in  astronomy. 

The  ostensive,  if  not  ostentatious,  point  of  beginning  with  a 
hidden  passage  in  the  history  of  Galileo  and  the  telescope,  may 
become  clearer  with  the  following  definition: 

Electronic  information  handling,  the  subject  of  this  meeting, 
is  a  rapidly  developing  technology.  It  is  parasitic  upon,  symbiotic 
with,  and  host  to  all  other  technologies.  Like  all  other  technologies, 
it  is  dependent  upon  a  body  of  fundamental  scientific  disciplines  and 
knowledge.  Advances  in  information  technology  can  only  come  in 
three  ways;  by  specific  research  and  development  efforts  aimed  at 
information  handling  per  se;  by  exploiting  the  fortuitous  advances 
in  ancillary  technologies;  and,  by  improvements  in  fundamental 
scientific  knowledge  and  understanding. 

The  invention,  or  continued  re-invention,  of  coordinate 
indexing  is  an  example  of  the  first;  the  continuing  improvements  in 
computers  designed  for  either  business  or  mathematics  of  the  second; 
and,  perhaps,  the  epistemological  battle  now  being  waged  between 
syntax  and  semantics  of  the  third. 

More  than  most  technologies,  with  the  possible  exception  of 
medicine  which  it  curiously  resembles,  information  handling  is 
involved  with  people  as  producers,  processors,  and  consumers  of 
information. 

Most  technologies  can  get  along  very  nicely  without  people, 
in  fact,  much  of  their  engineering  ei  ort  is  devoted  to  protecting 


their  systems  from  people.  A  little  old  lady  in  tennis  shoes  can  do 
more  damage  to  a  car  in  a  hundred  miles  driving  to  and  fro  through 
the  Liberty  Tubes  than  a  lead-footed  test  driver  will  do  in  1, 000 
miles  on  the  proving  ground;  whether  rightly  or  wrongly,  most 
aircraft  accidents  are  attributed  to  pilot  error,  and  the  majority  of 
automobile  accidents  happen  to  cars  in  excellent  mechanical 
condition.  One  can  build  foolproof  machinery,  but  there  is  no  such 
thing  as  a  people-proof  information  system. 

Let  me  talk  about  the  problems  of  people  as  producers  of 
information.  Last  February  in  Bangalore  I  mot  a  young  British 
engineer,  who  had  been  sent  out  to  India  to  manage  a  Horlick's 
malted  milk  factory.  After  the  third  gin  and  tonic  (the  first  two 
were  spent  in  discussing,  seriatim,  King  George  III  and  the  relative 
merits  of  the  European  four-wheel  drift  vs.  the  American  power 
broadslide  as  a  way  of  getting  around  corners),  ho  began  to  speak 
enviously  of  the  American  milkshed  system  where  the  manager  of 
a  factory  like  his  could  count  on  tank  trucks  of  pure  milk  pulling  up 
to  the  loading  bay  on  regular  schedules. 

In  India,  it  turns  out,  each  cow  is  owned  by  an  individual  who 
gets  up  before  daw'n,  milks  it  into  a  little  tin  pail  with  a  lid,  ties 
the  pail  on  the  back  of  his  tall  black  bicycle,  and  wobbles  precariously 
down  the  middle  of  the  road  for  10  miles  to  the  factory.  There  he 
exchanges  his  full  pail  for  a  sterilized  empty  one,  rides  10  miles 
back  to  his  village  and  promptly  washes  out  the  pail  under  the  village 
pump. 


Most  of  us  who  run  information  systems  would  like  to  he  in 
the  position  of  the  American  dairy  manager,  with  large  amounts  of 
pure  reliable  material  arriving  promptly.  We  actually  find  ourselves 
in  the  position  of  the  Indian  dairy  manager,  with  milk  that  may  never 
get  in  the  pails  and/or  be  consumed  in  th**  village  (I  am  reminded, 
somehow',  of  Mark  Tv.ain's  village  that  lived  by  taking  in  each  other's 
w’ashing),  or  gets  spilled  or  turns  sour  en  route  to  our  factory,  dealing 
with  producers  far  more  anarchic  than  the  Indian  cow  ow'ner,  with 
far  feebler  incentives  to  encourage  deUvery  at  the  factory  docks. 

We  need  people  to  run  our  systems  --  trained,  skilled, 
intelligent,  creative  people  who  will  neither  be  bored  by  routine 
nor  become  too  inventive  in  their  indexing,  much  as  we  w’ould  like 
to  automate  them  out  of  our  stacks,  our  accessions  departments, 
our  cataloging  rooms  and  our  reference  desks. 

Most  of  all  we  need  people  as  customers.  We  can  not  live 
solely  by  talking  to  other  information  centers  and  to  our  Federal 
sponsors.  There  comes  a  time  when  people  must  use  our  products. 
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Ranganathan  can  talk  of  "Every  reader  his  book",  T ime  can 
talk  of  "Every  non-reader  his  non-book."  We  mvjst  deal  with 
carnivores,  who  want  only  small  amounts  of  highly  concentrated 
information  and  turn  savage  if  not  cannibalistic  when  they  don't  get 
it;  with  placid  herbivores,  who  are  willing  to  munch  vast  heaps  of 
cellulose  to  extract  a  minimum  of  nutrition,  and,  with  the  vast  run 
of  omnivores,  who,  in  spite  of  their  innate  ability  to  digest  almost 
everything,  have  developed  sophisticated,  jaded  or  even  perverted 
appetites . 

I  will  now  return  to  the  specific  and  implied  subject  of  this 
talk  --  research  needed  for  the  improvement  o.  information 
technology.  You  will  remember  that  I  said  that  this  improvement 
could  come  in  only  three  ways: 

1.  By  specific  research  and  development  in  information 
handling  per  se. 

2.  By  exploiting  the  fortuitous  advances  in  ancillary 
technologies . 

3.  By  improvements  in  fundamental  scientific  knowledge  and 
understanding . 

Let  me  speak  of  the  easiest  part  first - 

By  exploiting  the  fortuitous  advances  in  ancillary  technologies  - 

Information  handling,  at  least  in  the  very  strict  sense  as  it 
applies  to  the  handling  of  scientific  and  technical  information,  is 
not  likely  to  be  a  major  customer  for  many  large  new  equipments. 
A  certain  inherent  reluctance  to  talk  about  rope  in  the  house  of  one 
who  lost  an  ancestor  when  the  platform  gave  way  while  he  was 
attending  a  public  function  keeps  me  from  mentioning  the  fate  of 
the  last  computer  to  be  designed  specifically  for  information 
retrieval  --  nevertheless,  computers  have  be'^n  getting  bigger  and 
better,  faster  and  cheaper  every  year.  We  might  v/ell  be  using  the 
Indian  pattern  of  Leicas  for  microfilming  and  studio  enlargers  for 
making  photo  copies  if  there  were  not  a  major  business  market 
for  microfilming  checks  and  industrial  records. 

I  am  not  at  all  sure  that  equipment  manufacturers  always 
understand  this  aspect  of  the  information  retrieval  market. 

People  do  occasionally  buy  Rolls  Royces,  Pegasos,  Ferraris  and 
Walnuts,  but  most  uf  us  are  in  the  position  of  borrowing  time  on 
someone  else's  Chevrolet. 

Perhaps  an  analogy  from  another  tieid,  that  of  mechanical 
translation,  will  make  my  attitude  clearer,  1  vas  visited  recently 
by  a  representative  from  a  small  software  firm  which  had  sunk 
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(I  refuse  to  use  the  word  invested)  $500,  000  of  corporate  funds  into 
a  mechanical  translation  program. 

I  said,  "How'  do  you  justify  this  to  your  stockholders?" 

"What  do  you  mean?" 

"Look,  DOD  has  said  somewhere  that  they  need  about 
60  million  words  of  Russian  text  translated  a  ytar.  You  know  damn 
well  that  we  can  buy  fair-to- middling  human  translation  at  20  bucks 
a  thousand  words,  and  probably  wouldn't  be  interested  in  machine 
translation  unless  we  could  get  it  considerably  cheaper  --  say 
10  bucks  a  thousand.  Assuming  that  a  contract  was  let  for  this, 
and  assuming  that  you  were  the  successful  bidder,  this  would  give 
you  a  gross  of  $600,  000  a  year  and,  at  10%  profit,  a  net  of  $60,  000, 
Are  you  sure  that  you  want  to  be  in  this  game'’" 

Or,  to  switch  to  another  field,  a  recent  report  on  the  mechaniza¬ 
tion  of  the  Library  of  Congress  set  a  price  tag  of  $30  million  for  the 
minimum  automation  of  the  central  bibliographic  system.  John  Walsh, 
in  one  of  his  quasi- editorials  in  Science  (143,  452-455,  1964)  doubted 
seriously  that  the  Congress  would  ever  appropriate  the  money  to  do 
this  job. 

Yet,  Missiles  and  Rockets,  in  a  recent  survey  of  display 
systems  for  command  and  control  (5  October,  1964)  estimates  in  a 
matter  of  fact  way  that: 

"Command  and  control  system  displays,  on  the  order 

of  $1  million  each,  are  expected  to  continue  at  the  rate  of 

25-30  a  year  for  at  least  5  --  10  years." 

It  is  .1  lot  cheaper  to  make  a  Bookmobile  out  of  a  commercial 
bus  than  to  start  from  scratch.  Most  of  us,  when  it  comes  to  major 
capital  equipment,  are  going  to  find  ourselves  on  the  winning  end  of 
the  game  that  the  Government  Printing  Office  plays  with  me  every 
time  I  send  a  book  over  for  printing  --  they  let  me  pay  for  the  costs 
of  setting  and  printing  the  first  4,  000  copies  and  then  charge  them¬ 
selves  only  the  incremental  costs  for  any  additional  copies  they  want. 
We  can  let  the  equipment  be  developed  and  paic  for  by  someone  else, 
and  then  modify  and/or  borrow  it  for  our  own  purposes,  rather  than 
pay  all  the  research  and  development  costs  for  the  first  prototype. 

Much  of  research  arid  development  in  information  handling  per  se 
seems  to  me  to  be  deficient  in  at  least  three  aspects: 

1.  The  absence  of  exciting  new  ideas. 

2.  The  test  of  the  market  place. 
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3.  Clear-cut  proof  to  the  complete  satisfaction  of  the  shirt 
sleeve  scientist,  the  grey  eminences  of  the  invisible 
colleges,  and  thoje  concerned  with  the  disbursement  of 
public  funds,  in  both  the  Legislative  and  Executive 
branches  of  the  Government,  that  the  job  we  are  trying 
to  do  is  socially  beneficial  rather  than  socially  harmless. 

(I  refuse,  even  for  the  sake  of  symmetry,  to  admit  the 
third  possibility.) 

It  is  difficult,  at  least  in  serial  speech,  to  discuss  these  three 
separately.  One  must  be  closely  linked  with  two,  lest  we  wind  up 
with  hand- set  letter  press  Selective  Dissemination  of  Information 
systems,  or  nation-wide  microwave  color  television  links  between 
laboratories,  turning  on  automatically  with  the  laboratory  lights, 
with  all  messages  going  automatically  on  videotape  into  a  central 
file  dwarfing  anything  that  any  dreamer  of  national  information 
systems  has  yet  conceived. 

Two  and  three  have  equally  close  links,  against  the  day  when  the 
full  national  expenditures  on  scientific  and  technical  information  are 
finally  dragged  out  from  under  all  their  ingenious  covers  and  some 
cold-eyed  gentleman  says  "O.K.  This  is  what  you're  spending.  What 
are  you  getting  for  it?" 

To  return  to  my  first  point.  Six  months  ago  I  spoke  in  this  same 
hotel  on  the  problems  of  scientific  creativity  under  the  title 
"The  Scientist,  The  Engineer,  The  Inventor  --  One  World  or  Three.  " 
We  are  slowly  training  a  competent  body  of  information  engineers  -- 
people  who  can  apply  known  principles  cleverly  and  skillfully  to  the 
solution  of  specified  problems.  Scientists,  as  I  shall  point  out  later 
in  my  talk,  are  being  attracted  to  the  field  in  growing  numbers  even 
though,  under  my  operating  slogan  of  "Sic  vos  non  vobis  mellificatis , 
apes  --  thus  you  bees  make  honey,  but  not  for  yourselves  alone," 
they  may  not  realize  that  that  is  what  is  happening.  But  we're  running 
short  of  inventors. 

This  Wednesday,  at  the  banquet  of  the  American  Documenf’tion 
Institute,  a  moving  tribute  was  paid  to  the  memory  of  a  gentleman 
whom  I  would  hope  considered  me  a  friend  --  Hans  Peter  Luhn. 

I  have  never  made  an  exhaustive  search  of  all  of  Pete's  contributions 
to  our  field,  but  let  me  just  mention  three  which  have  crossed  my 
rather  high  threshold  --  Selective  Dissemination  of  Information; 

Key  Word  in  Context  Indexing  and  Auto-abstracting.  For  years  now 
much  of  the  traffic  in  my  office  has  been  with  people  who  would  say, 
"Yes,  I  know  Pete  invented  this  techniqr^,  but  I  can  improve  on  it." 

It  is  not  difficult  to  improve  on  someone  else's  invention  --  Steve 
Juhasz,  Ed  Rippberger  and  I  have  been,  we  hope,  guilty  of  it  with 
WADEX  --  but  it  is  difficult,  and  for  most  people  impossible,  to  make 
an  invention  of  your  own.  It  is  even  more  difficxilt  for  an  invention  to 
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meet,  as  have,  at  least  two  of  Pete's  --  SDI  and  KWIC  --  tlu- 

test  of  the  market  place.  I  do  not  know  w'here  w^e  will  ever  find  more 

people  like  Pete  Luhn,  but  the  field  certainly  needs  them. 

I  am  not  sure  that  my  job  description  calls  for  me  to  be  either 
inventive  or  creative;  one  of  the  prices  of  becoming  an  administrator 
is  to  decline  the  fame  and  envy  of  original  composition,  but  there  are 
two  notions  that  I've  been  gnawing  on  for  a  while. 

One  is  the  need  for  a  scaling  factor  for  information  systems. 

I  hinted  at  this  in  my  most  unrequested  reprint  --  J.  Chemical 
Documentation,  3^,  Z16,  1963  --  where  I  voiced  my  suspicion  that  the 
square-cube  law  --  that  as  an  organism,  grows,  its  surface  increases 
as  the  square  of  the  diameter,  while  the  internal  volume,  and  mass, 
increase  as  the  cube  --  that  affects  all  living  organisms  also  applies 
to  information  systems.  I  feel  inaiitively,  but  lack  both  tha  evidence 
and  the  mathematics  to  prove,  that  the  surface  area  of  an  information 
system  available  for  radiation  --  the  transfer  of  informiation  outside 
the  system  --  increases  at  a  slower  rate  than  the  complexities  of 
interaction  between  the  items  in  the  store,  and  that  both  of  these  tend 
to  grow  far  more  rapidly  than  does  the  nutrient  supply  of  people  and 
money  needed  to  operate  the  system. 

An  interesting  consequence  of  the  square-cube  law  in  nature  is 
that  it  sets  both  a  lower  limit  --  something  the  size  of  a  shrew  has 
to  spend  all  its  time  eating  lest  it  starve  to  death  --  and  an  upper 
limit  to  the  size  of  organisms.  You  just  don’t  build  a  land-based  animal 
much  larger  than  the  elephant. 

I  wonder  if  this  square- cube  law  may  not  also  set  up  an  upper 
limit  to  the  size  of  information  systems;  if  the  internal  complexities 
are  growing  at  a  much  faster  rate  than  the  public  contact  area,  the 
manager  inevitably  becomes  more  concerned  with  the  internal 
management  than  with  the  public  service  .and,  inevitably,  gets  a  key 
to  the  dinosaur  club. 

I  wonder  also  if  we  have  not  been  remiss  in  forgetting  that  there 
are,  after  all,  four  laws  of  thermodynamics  in  our  concentration  on 
the  second,  I  can't  do  anything  constructive  with  the  first .  I  started 
thinking  about  the  third  when  I  started  thinking  about  the  entropy  of 
kr-owledge  --  that  sub- set  of  information  which  gets  inside  the  skull 
and  stays  there  long  enough  to  do  some  good  --  and  think  that  I  could 
do  something  about  that  in  relation  to  Boring's  minimum  set  of  dissonant 
paradigms  by  which  we  actually  operate. 

I  do  think,  though,  that  we  need  something  like  the  zeroth  law 
of  thermodynamics.  Thermodynamics  operates  on  the  assumption, 
amply  corroborated  by  experimental  evidence,  that  heat  flows  from 
hot  bodies  to  colder  ones,  and  never  in  the  reverse  direction;  that 
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heat  flows  from  heat  sources  to  heat  sinks.  It  was  many  years  before 
that  they  realized  that  they  needed  one  more  law,  the  zeroth  law  -- 
that  when  two  bodies  are  in  thermal  equilibrium  no  heat  flows  from 
the  one  to  the  other  --to  provide  a  logical  axiomatic  basis  for  the 

Other  three . 

We  operate,  I  submit,  on  the  assumption  that  information 
invariably  flows  from  information  sources  to  information  sinks.  Is  this 
a  safe  assumption?  Has  anyone  ever  proven  it,  either  theoretically  or 
empirically? 

Let  me  ’'eturn  to  my  points  two  and  three.  We  are  not  practicing 
a  branch  of  aesthetics  v/here  we  can  concern  ourselves  with  art  for 
art's  sake.  We  are  dealing  with  the  engineering  of  systems  to  do  a 
variety  of  jobs,  not  least  of  which  is  satisfying  both  our  customers  and 
our  sponsors.  We  think  we  know,  although  we  probably  do  not,  a  great 
deal  about  our  milieu  int«»rnale.  What  do  we  know  about  our  milieu 
externale? 

What  do  we  know  about  how  scientists  and  engineers  now 
communicate  and  use  information? 

What  do  we  know  about  the  relation  of  information  to  the  actual 
processes  of  scientific  research,  of  engineering  development,  of 

Invention? 

Just  what  is  it  that  information  and  information  services  actually 

do? 


What  sort  of  accepted  (and  acceptable)  methods  and  criteria  can 
be  used  for  evaluating  objectively  the  design  and  operation  of  informa¬ 
tion  systems  and,  perhapr  most  important  of  all,  their  actual  and 
potential  utilities. 

Or,  to  use  a  phras°  which  some  of  you  must  have  heard  before, 
how  do  you  do  a  cost -effectiveness  study  on  an  information  system’ 

I  would  be  less  th?  jracious  if  1  did  not  call  the  attention  of 
those  seeking  problems  on  which  to  do  research  to  the  prospectus  of 
the  Knowledge  Availability  Systems  Center  which,  at  least  in  the  draft 
I  have  (dated  1  August  1963),  outlines  some  Z9  more  or  less  separate 
problems  under  such  general  headings  as: 

Criteria  for  systems  design 
Comparative  anatomy  of  systems 
La.nguage  manipulation 
Behavioral  studies 
Hardware  studies 
Media  stuHies 
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At  least  a  third  o*  these  studies  la.,  into  tl  third  and  last  area 
I  wish  to  discuss  today,  bas-v  rj'sean  h  in  the  underlying  scientiiic 
disciplines  --  the  third  \Kay  m  wh.ch  I  said  improvements  in  information 
technology  could  come  about.  This  is  not  a  field  for  one  who  expects 
quick  results,  nor  immediate  a  pp’ ications,  nor,  for  that  matter,  is  it 
a  field  for  crash  programs.  I  am  rather  amused  by  the  plaint  of  a 
former  principal  investigator  of  mine,  who  once  did  good  basic  research 
for  me  and  now  finds  h.msoil  operating  a  multi  million  dollar  informa¬ 
tion  center,  that  there  vs  ..ttie  i  omir.g  out  of  any  of  the  three  major 
basic  research  programs  in  th.s  ueld  fthe  classification  is  by  sponsoring 
agency)  that  helps  him  with  .^’s  practica.  operating  problems 

Of  course  not.  Those  ol  us  who  ha\e  been  administering  basic 
research  programs  .n  this  :..e’d  'ao*.  d  be  dereUit  in  onr  duty  if  we 
yielded  to  our  chronic  temptation  and  cooked  our  seed  corn  --  sought 
the  approbation  of  our  bosses  by  b^ving  research  on  the  basis  of  its 
immediate  applications. 

Our  job  in  managing  basu  research  iS  to  bet  on  'ong  shots  at  the 
$2  window.  We  try  to  do  .s  on  a  .itt.e  more  rational  basis  than  the 
horses’  names  or  tlie  color  of  the  jockeys'  eves  •  aithojgh  I  must 
admit  that  we  do  pay  a  ittie  a’tent’on  to  the  <  c  or  oi  the  jockeys'  silks, 
especially  if  they  are  those  ci  a  major  stable.  A  horse  -  playing  former 
chief  scienl'st  of  onrs  o.nce  sa.dtlat  o  .r  job  was  looking  for  overlays  -- 
cases  where  the  true  odds  are  better  than  the  apparent  odds.  Other 
agencies  have  much  larger  s,.ms  to  bet  on  tavontes  to  wun,  place  or 
show,  at  corresponding  y  lower  odds,  favorites  do  drop  dead  in  the 
stretch;  long  shots  do  come  i  rom  beh.nd  to  win.  Tfus,  together  with 
the  traditional  difference  in  op.mon,  is  wnat  makes  horse  playing, 
and  the  administration  ol  a  b,is:<  resear,  h  program,  a  sporting  game. 

Where  does  one  go  ioo.i-.ng  for  researih  workers  .vho  might  be 
able  to  take  solid  steps  towards  solv.ng  this  problem?  (In  much  that 
follows,  I  might  qiute  proper./  be  acc..sed  of  exeri^ising  the  droite  du 
seigneur  on  a  report,  "Information  Processing  Heievart  to  Military 
Command:  Survey,  Re-  cmmerdat  ions  and  B  biiography,  '  prepared 
by  A,  E.  Murray  and  11,  R.  Lcland  o:  Cornei.  Ac  rona  Jt  ica  1  Laboratory 
under  Contrawt  AF  19(b2b',  162S  for  the  .System  Design  L.aboratory, 
Electronic  Systems  Div.sion,  Air  Font  Systems  Command.  ESD 
TDR-63-349.)  Somet  .mes,  b  ‘  o'^^.v  scmet  .rnes,  .r.  .'■t  hot  is  of 
documentation  and/cr  ,'.l)rar  ,;nd/or  ’-iicrmat  on  «•.  lence.  They  are 
likely  to  be  sc  attered  a  .  over  t|,e  i.n . \  e r s  ,  .amp.s,  iiot  .nlrequently 
in  the  Elect  Tital  Eng.ueer.'g  ciepa  rt me’’ i  wn  .  'i  ^as  ficucme  the 
Liberal  Arts  cotiege  c  eng'i  ccr.ng),  b  •  a  so  n  s'  -  n  dep.artments  as 
biophysics,  ph.iosopl'  ps  c  vuogv  or  mat>  emat  .  s.  Some  are  not 
even  on  univers'.ty  tamp.ses  ,0  ah,  b.  i  h.dden  awav  remote  corners 
of  great  industria;  researt  ■  nborator.es  or  ^  sma  R^■D  i.rms  in 
deserted  shopping  i  enters. 
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If  you  ask  them  what  they  are  working  om  they  are  unlikely  to 
answer,  unle»8  they  have  been  corrupted  by  the  thought  of  government 
funding,  by  such  phrases  as  "Information  storage  and  retrieval"  or 
"Electronic  Information  handling."  They  are  fa-  more  likely  to 
answer  with  such  phrases  (or  deecriptors)  as; 

Automata,  especially  logical  or  computing  automata 

Pattern  recognition 

Signal  detection 

Artificial  intelligence,  n  echanization  of  thought  processes, 
brain  mechanisms,  artificial  organisms,  cognitive 
processes 

Bionics 

Self-organizing  systems 
Cybernetics 
Nerve  (or  neural)  nets 
Perception  mechanisms  and  logics 
Discriminating  functions 
Decision  making 

Problem  solving,  game  playing,  heuristic  programming, 
hill  climbing,  optimization,  linear  programming, 
dynamic  programming 

Linguistics 

Logic,  especially  multi-valued  and  modal  logics 

Information  theory,  channel  capacity,  entropy  and 
uncertainty,  coding  theory 

General  aspects  of  correlation,  prediction  and  filtering 

Control  theory,  servomechanisms,  theoretical  and 
experimental  dynamics  of  feedback  systems 

Signals  and  noise 

Psychology  of  value  judgments 
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Statistical  prediction  theory 

Vision,  speech  and  hearing 

Concept  and  percept  formation 

Network  and  switching  theory 

Speech  analysis,  synthesis,  and  recognition 

Existential  and  analytical  philosophy 

Epistemology 

Combinatorial  mathematics 
Random  processes 
Probability  theory 
Circuit  theory 
Cryptology  ^ 

Statistical  communications  theory 
Programming  languages 

Use  of  these  terms  as  descriptors  in  querying  several  very 
large  document  collections  produced  some  7,000  different  citations 
to  documents  ! 

The  odds  that  one  or  more  of  these  50  fields  or  7,000  documents 
may  yield  results  relevant  to  the  problems  of  electronic  information 
handling  may  seem  staggering,  but  I  submit  that  they  are  far  less 
than  the  odds  that  out  of  the  tens  of  thousands  of  young  men  and  women 
in  our  colleges  and  universities  will  come  another  Hans  Peter  Luhn. 

The  names  of  the  possible  fields  given  were  deliberately 
randomized.  A  rough  classification  --  remembering  that  all  classifi¬ 
cations  are  persona]  to  the  point  of  being  solipsistic  --  might  yield 
the  following  five  areas  which  seem  in  especial  need  of  encouragement 
and  acceleration. 

1.  The  link  between  language  and  epistemology  defines  the 
single  most  important  front  for  an  advance  in  information 
processing  technology.  Linguistics  occupies  a  uniquely 
pivotal  position  in  relation  to  various  aspects  of  intelligence 
and  automata.  Natural  language  breaches  the  interface 
between  conscious  reasoning  and  the  underlying  mechanisms 
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and  serves  as  the  medium  for  the  conscious  organization, 
transmission,  storage  and  retrieval  of  information. 

Formal  versions  link  machines  to  man's  will  and,  within 
the  machines,  primitive  formal  languages  govern  and  are  represented 
by  the  states,  transitions  and  interactions  of  the  active  parts.  To 
understand  the  nature  and  basis  of  intelligence  so  as  to  exploit  this 
understanding  in  the  use  and  development  of  automata,  we  need  to 
know  much  more  about  language.  Similarly,  to  understand  more 
fully  the  techniques  of  symbolizing  and  systematizing  meaning  or 
concepts  in  order  to  exploit  this  understanding  in  analysis,  storage, 
cross-linking,  searching  and  retrieval  of  information,  we,  again, 
need  to  know  much  more  about  language. 

2,  Well  conceived,  firmly  based  and  definitely,  purposefully, 
and  theoretically  oriented,  as  opposed  to  vague,  exploratory 
or  empirical,  research  is  needed  to  discover,  at  approxi¬ 
mately  the  "neural"  level,  plausible  fundamental  mechanisms 
for  the  development  of  intelligence  in  information  processing 
organisms  and  automata. 

The  problem  of  discovering  the  basic  of  intelligence  appears 
to  be  essentially  the  problem  of  elucidating  how  any  brain-like  system 
can,  through  contact  or  interaction  with  its  environment,  become 
functionally  organized  in  that  special  way  we  call  "intelligent." 

By  referring  this  investigation  to  the  "neural"  level,  one 
seeks  the  ultimate  mechaniomic  basis  of  intelligence  by  taking 
explicit  account  of  the  importance  of  the  nature,  characteristics  and 
interaction  of  relatively  simple  components  in  those  special 
aggregates  capable  of  acquiring  and  exhibiting  intelligence. 

3.  Both  philosophical  and  experimental  evidence  indicate 
that  a  satisfactory  explanation  or  mechanization  of  visual 
pattern  perception  must  incorporate  both  analytic  an3 
holistic  concepts.  Analytic  pattern  recognition,  without 
regard  for  the  problems  of  segmentation  of  a  complex 
visual  field,  and  suitable  only  for  clean,  separated 
figures,  is  receiving  most  of  the  attention  devoted  by 
physical  scientists  for  all  too  obvious  reasons. 

What  is  needed  more  is  much  more  difficult  to  supply; 
that  is,  irformation  and  understanding  on  the  interrelation  between 
the  analytic  and  Gestalt  aspects  of  pattern  recognition;  how  and 
what  subsets  of  point  stimuli  are  perceived  as  unitary  entities; 
figure-figure  and  figure-background  separation  mechanisms;  and 
the  meaning  of  the  direction  and  limitation  of  attention. 

This  example  has  been  set  in  the  field  of  visual  pattern 
perception.  Similar  and  probably  more  complex  problems  face  us 
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in  the  field  of  speech  perception,  which  may  serve  as  an  orbital 
stage  before  we  tackle  the  vastly  more  difficult  problem  of  semantic 
perception.  It  is  becoming  increasingly  clear  that  speecn 
recognition  can  not  be  done  on  the  basis  of  the  acoustic  properties 
of  the  speech  signal  alone;  that  general  solutions  w'ill  rely  upon  the 
interplay  of  linguistics  and  semantics. 

The  most  exciting  step  of  all  will  come  when  we  are  able 
to  study  pattern  recognition  in  text.  How  does  a  reader,  for  example, 
recognize  that  novel  A  has  the  same  plot  as  novel  B?  How  does  a 
scientist  realize  that  a  piece  of  work  in,  say,  psychoacoustics 
contains  the  clue  to  solving  his  problems  in  cloud  cover  analysis? 

And,  how  long  will  it  be  before  a  computer  will  actually  be  able  to 
take  a  document  and: 

Make  a  true  abstract? 

Recognize  that  it  is  related  to  work  not  cited  in  the 

bibliography? 

Describe  it  as  brilliant,  pedestrian,  or  unsound? 

Tell  the  plot  of  a  novel. 

4.  Self-organization  appears  to  be  a  basic  phenomenon 

manifested  in  the  greatest  variety  of  systems  which  can 
be  described  and  understood  in  terms  independent  of  the 
particular  system  in  which  it  is  observed.  One  of  our 
needs  is  for  research  which  studies  seif-organization  as 
the  central  phenomenon  of  any  system  or  systems,  and 
attempts  to  describe  it  in  the  most  b.  ic  and  general  of 
terms.  In  this  regard,  two  facts  are  noticeable: 

a.  While  learning  may  be  regarded  as  a  certain  kind  of 
self-organizing  capacity,  the  bulk  of  the  work  by 
nonbiologists  in  systems  which  "learn"  is  not 
directed  to  the  central  issue,  which  is  the  epistemo¬ 
logical  problem  for  automata. 

b.  The  principles  of  self-organization  in  fields  outside 
of  cognitive  systems  research  are  all  but  neglected 
by  interdisciplinarians. 

Some  attention  must  1  e  directed  to  self-organization  as 
manifested  in  the  most  central  phenomena  underlying  intelligence, 
and  to  the  possibility  of  generalizing  on  the  principles  of  self- 
organizalion  over  fields  as  remote  as  morphogenesis  and  socio¬ 
economics. 
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5.  It  has  become  apparent  in  recent  years  that  the  major 
breakthroughs  in  computer  capability  in  the  future  will 
come  from  improvements  in  the  logical  organization  of 
computers  and  in  new  programming  techniques  The 
organization  of  the  digital  computer  as  conceived  by 
von  Neumann  seems  increasingly  inadequate  to  the  types 
of  problems  people  actually  wish  to  solve.  Concepts  such 
as  associative  memory,  Vjuilt-in  stacks,  multiprocessing, 
multiprogramming  and  parallel  organization  represent  a 
radical  departure  from  traditional  ways  of  building 
computers,  quite  apart  from  the  harvfware  used.  At  the 
same  time,  the  difficulties  that  people  have  in  communicating 
problems  to  computers  have  become  more  and  more  pressing 
as  the  complexity  of  these  oroblems  has  grown. 

Areas  of  effort  most  likely  to  extend  the  capability  of  the 
digital  computer  include  machine  organization,  programming 
techniques  and  information-handling  techniques. 

The  problems  of  machine  organization  are  concerned  with 
ways  of  constructing  deterministic,  programmable  devices  that  can 
be  used  to  solve  problems.  Continuing  success  in  the  study  of 
relatively  large  complexes  of  relatively  simple  components  as  in 
distributed  element  computers  will  require,  either  for  its  own 
prosecuting  or  its  exploitation  in  useful  automata,  a  solution  to  the 
problems  of  space  consumption,  power  requirements  and  the  costs 
of  layout,  assembly  and  interconnection  of  the  components.  While 
microminiaturization  itself  probably  needs  no  further  encouragement, 
attention  to  the  comprehensive  solution  of  space,  power  and 
interconnection  problems  is  especially  recommended. 

Computers,  at  least  from  the  programmer's  view,  are 
mathematically  well-defined  structures  in  which  random  events  are 
virtually  nonexistent,  or  so  he  hopes.  Nevertheless,  although  a 
number  of  abstract  modelling  devices  for  machines,  such  as  finite 
state  machines  and  other  constructs  of  automata  theory,  do  exist, 
the  general  description  of  these  structures  has  never  been  fully 
formulated.  Such  a  formalism  could  provide  a  basis  for  a  compl  te 
yet  uniform  mode  of  machine  description  or,  more  pragmatically, 
could  also  serve  as  a  device  to  permit  automatic  generation  of 
programs  for  many  different  machines. 

Programming  techniques  are  concerned  with  ways  of 
applying  a  computing  engine  to  solve  many  different  unrelated  problems. 
Very  early  in  the  computer  game  it  became  recognized  that  machine 
language  was  not  a  particularly  efficient  way  of  posing  problems  to 
a  computer.  An  increasing  number  of  programming  demands  are 
being  met  by  problem-oriented  languages. 
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(Conversation  at  a  recent  Association  for  Computing 
Machinery  convention; 

"Hi,  Joe.  What's  nev/?" 

Joe,  proudly:  "I've  invented  a  new  programming  language." 

"So?  So  what  else  is  new?") 

One  question  concerns  the  way  in  which  such  languages  are 
described  --  a  crucial  question  because  of  the  increasing  need  for 
translators  for  these  languages.  Each  new  language  generates  a 
requirement  for  a  translator  for  many  existing  machines.  Formal, 
and  hence  machine  manipulatable,  descriptions  of  programming 
languages  are  therefore  increasingly  in  demand. 

Another  question  concerns  bridging  the  gap  between  human 
languages  and  programming  languages.  There  are  significant  structural 
differences  between  the  two.  Human  languages,  at  least  when  talking 
to  inferior  beings  like  children,  wives  and  computers,  are  constructed 
mainly  of  imperatives.  Most  of  the  work  in  developing  new  program¬ 
ming  language  has  been  concerned  with  their  local  structure  rather 
than  with  their  global  structure,  i.  e.  with  the  way  that  things  are 
said  rather  than  with  the  kinds  of  things  that  are  said.  Better  impedance 
matching  between  human  and  programming  languages  could  improve 
materially  the  ability  of  people,  even  trained  programmers,  to 
communicate  with  computers. 

Computer  programs  with  learning  ability  are  needed  -- 
some  way  to  use  the  computer  in  the  process  of  finding  problem¬ 
solving  algorithms  as  well  as  in  the  process  of  executing  these 
algorithms.  Human  beings  can  deal  with  complex  problems  only 
if  they  have  a  means  of  organizing  them;  computers  can  deal  with 
complexity  through  brute  force.  Problems  that  people  often  think 
of  as  ill  defined  are  really  problems  for  which  the  solution  algorithm 
is  too  complex  for  human  comprehension. 

In  such  circumstances,  a  man- machine  dialog,  at  a  slightly 
more  complex  level  than  "Me  Tarzan.  You  IBM"  must  be  created, 
with  the  machine  playing  a  more  active  role.  Tlie  machine  must 
learn  about  the  problem,  and  in  order  to  learn  it  must  be  able  to  ask 
questions. 
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L'ENVOI 


This  talk  has  covered  a  span  of  some  four  centuries,  from 
the  Magiae  Naturalis  of  Giambattista  d%lla  Porta  circa  1584  to  an 
Orweilian  world  of  dialectics  with  intelligent  computers  in  1984. 

There  are  two  things  that  I  hope  you  will  take  away  with  you 
from  this  talk. 

One  is  the  moral  (or  immoral)  of  the  story  of  Galileo  Galilei 
and  the  telescope  --  that  apart  from  moral,  legal  and  ethical 
considerations,  it  doesn't  really  matter  where  an  idea  comes  from 
if  you  can  figure  out  a  better  use  for  it. 

The  other  is  the  set  of  premises  on  which  this  talk  is  based: 

Electronic  information  handling  is  a  rapidly  developing 
technology.  It  is  parasitic  upon,  symbiotic  with,  and  host  to  all 
other  technologies.  Like  all  other  technologies,  it  is  dependent 
upon  a  body  of  fundamental  scientific  disciplines.  Advances  in 
information  technology  can  only  come  in  three  ways: 

By  specific  research  and  development  efforts  aimed  at 
information  handling  per  se; 

By  exploiting  the  fortuitous  advances  in  ancillary 
technologies; 

And,  by  improve  lents  in  fundamental  scientific  knowledge 
and  understanding. 

For,  after  all,  the  motto  of  my  organization,  the  Air  Force 
Office  of  Scientific  Research,  is  taken  from  Ecclesiasteses: 
Primum  acquirere  rognitionem  --  "First,  get  thee  understanding." 
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